GenNext: A Consolidated Domain Adaptable NLG System
نویسندگان
چکیده
We introduce GenNext, an NLG system designed specifically to adapt quickly and easily to different domains. Given a domain corpus of historical texts, GenNext allows the user to generate a template bank organized by semantic concept via derived discourse representation structures in conjunction with general and domain-specific entity tags. Based on various features collected from the training corpus, the system statistically learns template representations and document structure and produces well–formed texts (as evaluated by crowdsourced and expert evaluations). In addition to domain adaptation, GenNext’s hybrid approach significantly reduces complexity as compared to traditional NLG systems by relying on templates (consolidating micro-planning and surface realization) and minimizing the need for domain experts. In this description, we provide details of GenNext’s theoretical perspective, architecture and evaluations of output.
منابع مشابه
Toward an NLG System for Bantu languages: first steps with Runyankore (demo)
There are many domain-specific and language-specific NLG systems, which are possibly adaptable across related domains and languages. The languages in the Bantu language family have their own set of features distinct from other major groups, which therefore severely limits the options to bootstrap an NLG system from existing ones. We present here our first proof-of-concept application for knowle...
متن کاملCorpus-Driven Generation of Weather Forecasts
In traditional natural language generation (NLG), careful analysis of a corpus of example texts and determining the single correct sublanguage behind it is seen as one of the main tasks of the NLG system builder. In practice, this often means elimination of variation in the corpus and specification of conditions for rule application to the point where an NLG system becomes (virtually) determini...
متن کاملText Generation for Brazilian Portuguese: the Surface Realization Task
Despite the growing interest in NLP focused on the Brazilian Portuguese language in recent years, its obvious counterpart – Natural Language Generation (NLG) – remains in that case a little-explored research field. In this paper we describe preliminary results of a first project of this kind, addressing the issue of surface realization for Brazilian Portuguese. Our approach, which may be partic...
متن کاملDomain Adaptable Semantic Clustering in Statistical NLG
We present a hybrid natural language generation system that utilizes Discourse Representation Structures (DRSs) for statistically learning syntactic templates from a given domain of discourse in sentence “micro” planning. In particular, given a training corpus of target texts, we extract semantic predicates and domain general tags from each sentence and then organize the sentences using supervi...
متن کاملEffectiveness of GenNext framework on critical parameters of ERP implementation: a statistical comparison of traditional methodology and Gennext framework
Enterprise resource planning (ERP) implementations are known for high failure rates and crossing the defined budget and schedule. A new framework GenNext is introduced to arrest these issues. The objective of this paper is to validate the effectiveness of the framework and compare the benefits with respect to traditional methodology. Five projects were executed by the traditional methodology an...
متن کامل